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ABSTRACT 

The overall performance of content distribution networks as 
well as recently proposed information-centric networks rely 
on both memory and bandwidth capacities. In this frame- 
work, the hit ratio is the key performance indicator which 
captures the bandwidth / memory tradeoff for a given global 
performance. 

This paper focuses on the estimation of the hit ratio in a 
network of caches that employ the Random replacement 
policy. Assuming that requests are independent and iden- 
tically distributed, general expressions of miss probabilities 
for a single Random cache are provided as well as exact re- 
sults for specific popularity distributions. Moreover, for any 
Zipf popularity distribution with exponent a > 1, we obtain 
asymptotic equivalents for the miss probability in the case 
of large cache size. 

We extend the analysis to networks of Random caches, when 
the topology is either a line or a homogeneous tree. In that 
case, approximations for miss probabilities across the net- 
work are derived by assuming that miss events at any node 
occur independently in time; the obtained results are com- 
pared to the same network using the Least-Recently-Used 
discipline, already addressed in the literature. We further 
analyze the case of a mixed tandem cache network where the 
two nodes employ either Random or Least-Recently-Used 
policies. In all scenarios, asymptotic formulas and approxi- 
mations are extensively compared to simulations and shown 
to perform very well. Finally, our results enable us to pro- 
pose recommendations for cache replacement disciplines in 
a network dedicated to content distribution. These results 
also hold for a cache using the First-In-First-Out policy. 

Categories and Subject Descriptors 

C.2.1 [Computer-Communication Networks]: Network 
Architecture and Design — Packet-switching networks 

General Terms 

Performance Evaluation, Caching Networks 
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Content Distribution Networks, Information-Centric Net- 
working, Cache Replacement Policies, Asymptotic Analysis 

1. INTRODUCTION 

Communication networks use an ever increasing amount of 
data storage to cache information in transit from a source to 
a destination. Data caching is an auxiliary function, where 
a given piece of data is temporarily stored into a memory. 
The cache might then be queried at any time to provide 
an object that is possibly stored in that memory. Caches 
are typically located downstream a bandwidth bottleneck, 
e.g., a communication link with limited bandwidth or a 
shared bus in a network of chips. This storage then allows 
to increase the throughput of the data path and to decrease 
the rate of congestion events. 

Many communication networks fall into such a model: the 
Internet, Content Delivery Networks (CDNs), Peer-To-Peer 
(P2P) as well as networks on chips. In fact, the increasing 
amount of content delivered to Internet users has pushed 
the use of Web caching into communication models based 
on distributed caching such as CDNs. New information- 
centric network architectures [17[ |22| [25] have been recently 
proposed in order to have built-in network storage as a fun- 
damental element of the underlying communication model. 
Content storage becomes a fundamental resource in such 
networks, aiming at minimizing content delivery time under 
an ever increasing demand that cannot simply be satisfied 
by increasing link bandwidth. Moreover, the use of network 
storage, enabling one to cache content in order to bypass 
bandwidth bottlenecks, appears cost effective as memory 
turns out to be cheaper than transmission capacity. 

One of the fundamental operations of a cache is defined by 
its replacement policy which determines the object to be re- 
moved from the cache when the latter is full. Many replace- 
ment policies are based on content popularity, with signifi- 
cant cost for managing the sorted lists. This is the case, in 
particular, for the Least Frequently Used (LFU) policy and 
more sophisticated variants based on it. On the contrary. 
Most Recently Used (MRU), Least Recently Used (LRU), 
First-In-First-Out (FIFO) and Random (RND) policies have 
the compelling feature to replace cached objects with con- 
stant delay 0(1). In-network storage, as envisaged in the 
new architectures mentioned above, may require packet-level 
caching at line rate; current high-speed routers running com- 
plex replacement policies might not, however, sustain such 



high line rates [26]. In this framework, the RND and FIFO 
poHcies can therefore be seen as presenting the least possible 
complexity. In fact, RND and FIFO requires less memory 
access per packet than LRU or MRU, and it has been shown 
[26| that this advantage is critical for sustaining high-speed 
caching with current memory technology. 

In this paper, we address performance issues of caching net- 
works running the RND replacement policy. We mainly fo- 
cus on the analytical characterization of the miss probabil- 
ities under the Independent Request Model (IRM) assump- 
tions when the number of available objects is infinite. We 
first provide exact formula for the miss rate in the case of a 
light-tailed distribution content popularity, namely geomet- 
ric, and for specific Zipf popularity distributions. Proposi- 
tion |3.9| then proves that when the popularity distribution 
follows a general power-law with decay exponent a > 1, the 
miss probability is asymptotic to ApaC^'" for large cache 
size C, where constants A and pa depend on a only. In 
Proposition |3. 10} we extend that result to miss probabilities 
conditioned by the object popularity rank. 

A second major contribution of the paper is given by Propo- 
sition |5.1[ where we evaluate the performance of network of 
caches under the RND policy, for both linear and homo- 
geneous tree networks and asymptotically Zipf popularity 
distributions. An approximate closed formula for the miss 
probability across the network is provided and compared to 
corresponding estimates for LRU cache networks. The anal- 
ysis is also extended to the mixed tandem cache network 
where one cache employs LRU and the other one uses RND. 
Note that the choice of FIFO replacement policy instead of 
RND does not impact any result. 

The specific focus on Zipf distributions, or more generally 
power-law distributions, made in this paper is motivated 
by numerous studies on Internet object popularity, start- 
ing from the late 90's experiments on World Wide Web 
documents (p],[3]) , the content stored in enterprises me- 
dia servers ( [9|,[11| ) and to recent studies on Internet media 
content ([t], [24| yT^ While other content popularity distribu- 
tions might be considered, we do not provide here a complete 
review of the literature on Internet content popularity char- 
acterization; the above references confirm the pertinence of 
Zipf distributions for studying caching performance. 

The remainder of the paper is organized as follows. Section 
[2]presents related work on the analytical performance evalu- 
ation of caching systems. Section [3] analyzes the RND cache 
replacement policy and its comparison to LRU for a single 
cache; these analytical results are compared with exact nu- 
merical and simulation results in Sectionjl] Section|5]reports 
the approximate analysis of the network of RND caches for 
two topologies, namely the line and the tree. Numerical and 
simulation results for the network case are reported in Sec- 
tion [6] Section [T] further evaluates the tandem cache system 
with one LRU and one RND cache, using numerical and 
simulation results. Section [s] concludes the paper (several 
proofs are detailed in the Appendix). 

2. RELATED WORK 

There is a significant body of work on caching systems and 
their associated replacement policies that we will not at- 



tempt to review thoroughly. Here we only report the lit- 
erature focusing on the analytical characterization of the 
performance of such systems. 

The most frequently analyzed replacement policy is LRU 
whose performance is evaluated considering the move-to- 
front rule, consisting in putting the latest requested object 
in front of a list; a miss event for a LRU cache with finite 
size takes place when the position (also referred to as search 
cost) of an object in the list is larger than that size. Under 
the Independent Request Model, [2^ calculates the expected 
search cost and its variance for finite lists. An explicit for- 
mula is given in [4] [l6] for the probability distribution of 
that cost; such a formula is, however, impractical for nu- 
merical evaluation in case of large object population and 
large cache size. Integral representations obtained in [13[ 
|14| using the Laplace transform of the search cost function 
reduce the problem to numerical integration. 

An asymptotic analysis of LRU miss probabilities [Ts] for 
Zipf and WeibuU popularity distributions provides simple 
closed formulas. Extensions to correlated requests are ob- 
tained in |10[ |20| , showing that short term correlation does 
not impact the asymptotic results derived in [Ts] ; the case of 
variable object sizes is also considered in [2l]. The analytical 
evaluation of the RND policy has been first initiated by 15 
for a single cache where the miss probability is given by a 
general expression. To the best of our knowledge, its appli- 
cation to specific popularity distributions has, however, not 
yet been envisaged together with its numerical tractability 
for large object population and cache size. 

We are aware of few papers that address the issue of net- 
works of caches. A network of LRU caches has been analyzed 
by using the approximation for the miss probability at 
each node obtained in 
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and assuming that the output 
of a LRU cache is also IRM; miss probabilities can then be 
obtained as the solution of an iterative algorithm that is 
proved to converge. The results of 18 are extended in [6] 
to a two-level request process where objects are segmented 
into packets, when assuming that the LRU policy applies 
to packets. The analysis is applied to line and tree topolo- 
gies with in-path caching. Moreover, [s] extends ^ when 
network links have finite bandwidth. 

3. SINGLE CACHE MODEL 

In this section, we address the single cache system with RND 
replacement policy, deriving analytic expressions of the miss 
probability together with asymptotics for large cache size. 
To avoid technical results at first reading, the reader may 
quickly go through the notation of Section 3.1 and directly 
skip to main results given in Propositions |3.9| and |3. 10] 

3.1 Basic results 

Consider a cache memory with finite size which is offered 
requests for objects. If the cache size is attained and a re- 
quest for an object cannot be satisfied (corresponding to a 
miss event), the requested object is fetched from the repos- 
itory server and cached locally at the expense of replacing 
some other object in the cache. The object replacement pol- 
icy is assumed to follow the RND discipline, i.e. whenever a 
miss occurs, the object to be replaced is chosen at random, 
uniformly among the objects present in cache. 



We consider a discrete time system: at any time t £ N, 
the t-th requested object requested from the cache is de- 
noted by R{t) £ {1,2, A*'}, where A'^ is the total number 
of objects which can be possibly requested (in the present 
analysis, the set of all possible documents is considered to 
be invariant in time). We assume that all objects are 
ordered with decreasing popularity, the probability that ob- 
ject r is requested being Qr, 1 < r < N. In the following, 
we consider the Independent Reference Model, where vari- 
ables R{t), t G N, are mutually independent and identically 
distributed with common distribution defined by 

¥{R = r) = qr, 1 < r < N. 



The numerator of ([3| can in turn be expressed as 

l<jl<...<jC<N r^{ji,...,jc} 

i<ii<...<jc<N 

X I Y 1r + ■■■ + Y 1r+ Y 1''] 

\^l<r<ji jc-l<r<jc jc<r<N j 

= (C + 1) Y grg,i...g,c = (C + 1)G(C+1) 

l<r<ji<...<jc<-'V 

and expression ^ of M(C) follows □ 



Let C < be the cache capacity and denote by Mc the 
set of all ordered subsets {ji,---,ic} with 1 < jfc < A'^, 
k £ {1,...,C}, and jk < je for k < £. Define the cache 
state at time t G N by the vector S{t); S{t) may equal any 
configuration s = {ji, jc} £ A/c- In the following, we let 



G{C) 



Y *i 



■lie 



(1) 



with G(0) — 1. Let M(C) finally denote the stationary miss 
probability calculated over all possibly requested objects. 
It is shown [lE] that the cache configurations (S(f))tGN de- 
fine a reversible Markov process with stationary probability 
distribution given by 



G JVn 



(2) 



moreover ([15], Theorem 4), probability M(C) equals 



E 



<lii-<ljc 



Y 



M{C) 



^jl,■■■Jc}e^J'c 



Y Ill-lie 

{.;i,---,Jc}6A/'c 



(3) 



In 15 , it is also shown that the cache configuration sta- 



tionary probability distribution and miss rate probability 
are identical in the case of a FIFO cache, hence our results 
apply for the FIFO policy as well. 

Expression ^ can be actually written in terms of normaliz- 
ing constants G{C) and G{C +1) only; this will give formula 
([3| a compact form suitable for the derivation of both exact 
and asymptotic expressions for M(C). 



The latter results readily extend to the case when the total 
number A' of objects is infinite, since the series Ej>igj is 
finite. The calculation of coefficients G{C), < C < N, is 
now performed through their associated generating function 
F defined by 

F{z)= Y G(C)^^, z&C, 

0<C<N 

for eith er fin ite or infinite population size A' (as M{C) < 1, 
Lemma[3T] entails that G(C-h 1)/G(C) < 1/(G + 1) and the 



ratio test implies that the power series defining F{z) has 
infinite convergence radius). We easily obtain the second 
preliminary result. 

Lemma 3.2. The generating function F is given by 

F{z)= n i^ + lr^) (5) 

l<r<JV 

for all z £ C. 

Proof. Expanding the latter product and using defini- 
tion Q readily provide the result □ 

To further study the single cache properties, let Af, (G) de- 
note the per-object miss probability, given that the requested 
object is precisely r G {1, ...,7V}. Defining 

G.(G)= Y (6) 

l<il<.. .<ic<iV, r^{ji....,jc) 

with Gr{0) = 1, we then have Mr{C) = P(r ^ S) so that Q 
and ([6| yield 

Gr{C) 



Mr{C) = 



G(G) 



(7) 



Lemma 3.1. The miss rate M{C) is given by 

G(G+ 1) 



M(G) = (C-f- 1)- 



G(G) 



(4) 



with G{C) defined m (Tip 



Proof. Any state s — {ji, jc} G A/c corresponding 
to a unique sequence 1 < ji < J2 < ... < ic < A'' with 
1 < j* < TV, the denominator of (|3| therefore equals G(G). 



Lemma 3.3. The per-object miss probability MriC) for 
given r G {1, A''} can be expressed as 



Mr(C) = 1+Yi-1- 



G{cy 



(8) 



The stationary probability qr{'2), r G {1, A'^}, that a mtss 
event occurs for object r is given by 



where M{C) is the averaged miss probability. 



(9) 



Proof. By definition ([6|, the generating function Fr{z) 
of coefficients G'r(C), < C < A'^, is given by 



(10) 



Fr{z)^-^^, zee. 

1 + qrZ 

Expanding the latter ratio as a powers series of z gives 

Gr{C) = J2(-1rf''G{£) 



and provides ([sjl after using definition Q for Mr{C). Be- 
sides, letting A4 denote a miss event, Bayes formula entails 



¥{M) 



M{C) 



hence relation ([9| □ 



If the popularity distribution has unbounded support, then 
limr^+oo qr ~ and formula Q implies that the per-object 
probability Mr{C) tends to 1 as r t +oo for fixed C; ([9| 
consequently entails 



qr{2) 



qr 



M{C) ' 



r t +0O. 



(11) 



For given C, asymptotic ( |11[ ) shows that the tail of distri- 
bution (gr(2))reN at infinity is proportional to that of dis- 
tribution (qr)rgN- The distribution (gr(2))r6N describes the 
output process of the single cache generated by consecutive 
missed requests; it will serve as an essential ingredient to 
the further extension of the single cache model to network 
cache configurations considered in Section [5] 

3.2 First applications 

Coefficients G(C), C > 0, and associated miss probability 
M{C) can be explicitly derived for some specific popularity 
distributions. In the following, the total population A'' of 
objects is always assumed to be infinite. 



Corollary 3.4. Assume a geometric popularity distri- 
bution qr = [1 — hi)K,^~^ , r > 1, with given k £ ]0, 1[. For all 
C > 0, the miss rate equals 



M{C) 



<c+ 



(12) 



Proof. Using Lemma |3.2[ F is readily shown to verify 
the functional identity F{z) = (1 + (1 — h)z)F{kz) for all 
2 G C. Expanding each side of that identity in power series 
of z and identifying identical powers p rovi des the value of 
the ratio G(C + 1)/G(C), hence result ([l2]| by Q □ 



Let us now assume that the popularity distribution follows 
a Zipf distribution defined by 



qr = 



A 



r- > 1, 



(13) 



with exponent a > 1 and normalization constant A = l/^(a), 
where C, is the Riemann's Zeta function. We now show how 
explicit rational expressions for miss rate M{C) can be ob- 
tained for some integer values of a. 



Corollary 3.5. Assume a Zipf popularity distribution 
with exponent a. For all C > 0, the miss probability equals 

3 



M(C) = 



2(7 + 3 



45 



(4C + 5)(4C' + 3)(2C + 3) 
9! (C + 1) 
3! n (6G + j) 

4<j<9 



if a = 2, 
if a = 4, 

if a = 6, 



Proof. When a = 2, the normalization constant equals 
A — 1/C(2) ~ 6/iT^. From the infinite product formula [l] 

^, , TT /, \ sinhit 

and expanding the left hand side into powers of = Azn^ 
gives the expansion F(z) = T,c>iG{C)z^ where 

G(C) = {n^Af/{2C + 1)!, G > 0. 

Computing then ratio Q with the above expression of GiC) 
then provides M[C) = 3/(2G + 3), as claimed. The cases 
when Q = 4 or ci = 6 follow a similar derivation pattern □ 



The formulas of Corollary |3.5| do not seem, however, to gen- 
eralize for integer values a = 2p with p > 4; upper bounds 
can be envisaged and are the object of further study. 



As also suggested by Corollary |3.5| the cache size corre- 
sponding to a target miss probability should be a decreasing 
function of a. This property is generalized in Section |3.3| 
where an asymptotic evaluation of M{C) is provided for 
large cache size C and any Zipf popularity distribution with 
real exponent a > 1. 

3.3 Large cache approximation 

The specific popularity distributions considered in Corollar- 
ies [3!4] and [3^ show that M{C) is of order Cqc for G. In the 
present section, we derive general asymptotics for probabil- 
ities M{C) and Mr{C) for large cache size and apply them 
to the Zipf popularity distribution. 

We first start by formulating a general large deviations result 
(Theorem |3.6[ ) for evaluating coefficients G(G) for large G. 
To apply the latter theorem to the Zipf distribution ( 13]), we 



then state two preliminary results (Lemmas 3.7 and 



3.81 on 



the behavior of the corresponding generating function F at 
infinity. This finally enables us to claim our central result 
(Proposition |3.9[l for the behavior of M(G) for large G. 



Theorem 3.6. (See Proof in Appendix\A^ 

(i) Given the generating function F defined in equation 

zF'{z) = CF{z) (14) 

has a unique real positive solution z = 9c for any given 
C > 0; 

(ii) Assume that there exists some constant a > such that 
the limit 



,. sVc Fjece-'/^) ,2^2/2 
lim e — ^ ; — r — e ' 



(15) 



holds for any given s £ C with K(s) = and that, given any 
(5 > 0, there exists rj g]0, 1[ and an integer Cs such that 



sup 

S<\y\ <TT 



F{ec) 



i/c 



(16) 



for C > Cs- We then have 

G{C)r 



exp{Hc 



as C tends to infinity, with He = \ogF{9c) — Clog 6c 



(17) 



Following Theorem |3.6[ the asymptotic behavior of AI{C) 
can then be derived from (17 1 together with identity Q. 
This approach is now applied to the Zipf popularity distri- 
bution 1 13 1; in this aim, the behavior of the corresponding 



generating function F is first specified as follows. 



so that Hc+i — He = — alogC + k + o(l) with constant 
k = log Pa + log A. Coming back to definition Q of M(C), 
the latter estimates enable us to derive that 

M{C) = (C7 + 1)^^^ ~ CeMHc+i - He] 



= Pc^C qe 



(22) 



as claimed □ 



Remark 3.1. Proposition provides asymptotic l[21]/ 
for M(C) under the weaker assumption that the popular- 
ity distribution q,., r > 1, has a heavy tail of order r~°' for 
large r and some a > 1, without being precisely Zipf as in 
JTm. In fact, all necessary properties for deriving Lemmas 
\3.7\and\3.8\ are based on that tail behavior only. 



Lemma 3.7. (See Proof in Appendix\B\l 

For Q > 1 and large z £ C \ R~, log F{z) expands as 

log F{z) = aip^Azf^" - ^ \og{Az) + + o(l) (18) 

with A — l/(^{a) and Sa depending on a only and 

■K I a. 



pa 



sin(7r/a) 



(19) 



Lemma 3.8. (See Proof in Appendix^D^ 

For a > 1 and large C , the unique real positive solution 9e 
of equation UjV verifies 



0e = ^+ C"-\e 

Apa 



(20) 



with re = Ai + 0{C ^) with Ai = a/2paA i/ a / 2 and 
re = O(logC) i/a = 2. 



We can now state our central result. 



Proposition 3.9. For a Zipf popularity distribution with 
exponent Q > 1, the miss probability M{C) is asymptotic to 



M{C) ~ paCqe 



Apa 
(Ja- 



(21) 



for large C , with prefactor pa given in f^H 
Prefactor pa verifies HmQ-t-+oo Pa ~ ^ and pa ~ l/(c* ~ 1) is 
a 4- 1- 

Proof. As verified in Appendix [D] the conditions of 
are satisfied for a Zipf distribution. Using 
T8| and pOl of LemmaslS^andlS^to exphcit 



Theorem 13.6 



asymptotics 



the argument He in (171 for large C, we then have 



He = log Fide) -Clog Oe 



a{paAOe) 
C" 



logiAOe) + Sa + o(l)- 



Clog 



Apa 



+ C re 



To close this section, we now address the asymptotic behav- 
ior of Mr{C) defined in (0. 



Proposition 3.10. For any Zipf popularity distribution 
with exponent a > 1 and given the object rank r, the per- 
object miss probability M,.(C) is estimated by 



Mr{C) 



Par" 



C" + Par" 

for large C , with prefactor pa defined in \1S\ 



(23) 



Proof. The generating function Fr of the sequence GriC), 
C > 0, being given by ( |10[ ), apply Theorem |3.6| to estimate 
coefficients Gr{C) for large C. Concerning condition (i), 
the solution rj — ric to equation r]F^{ri) — CFr{rf) reduces 
to equation ( 14 1 for — 9e where the term qr0/{l + qr6) has 
been suppressed; but suppressing that term does not modify 
the estimate for 6e with large C so that rje ^ 9e- On the 
other hand, condition (ii) is readily verified by generating 
function F^ and we then obtain 



Gr{C) 



G{C) 
1 + qrOc ' 



(24) 



By Lemma 3.8 we have 9e ~ C"" /Apa for large C; definition 
^ of Mr{C) and estimate (|24| with qr = A/r"" give 



Mr(C) 



Gr{C) _ 1 

G(C) ~ 1 + qr9e C« -I- Par'' 



par 



(25) 



and result ( 23 1 follows □ 



For any value a > I, ( 23 1 is consistent with the fact that 



MriC) is an increasing function of object rank r and a de- 
creasing function of cache size C. 

3.4 Comparing RND to LRU 

Let us now compare the latter results with the LRU replace- 
ment policy investigated in |14[ |18|. Recall that, for a Zipf 
popularity distribution with exponent a > 1, the miss prob- 
ability M(C) for LRU is estimated by 

M(C) ~ \aCqc 



for large (7, with prefactor 



1 












a 







(26) 



where V is Gamma function (flW, Theorem 3). is esti- 
mated by 



as a t +00 and a ^ 1, respectively (7 is Euler's constant 
and e"' ~ 1,781...). In view of Proposition |3.9[ comparing 
asymptotics for coefficients A^ and pa shows that the differ- 
ence pa — \a tends to 1 as Of t +00, thus illustrating that 
LRU discipline performs better than RND for large enough 
a (it can be formally shown that pa > A^ for all ol > 1). 
This difference diminishes, however, for smaller values of 
a since pa and \a behave similarly as a is close to 1 (see 
Figure [T|. Apart from that limited discrepancy, both disci- 
plines provide essentially similar performance levels in terms 
of miss probabilities for large cache sizes. 



5 6 
a 



Figure 1: Prefactors pa and \a with a > 1, for RND 
and LRU policies, respectively. 



In contrast to heavy-tailed popularity distributions, we can 
consider a light-tailed distribution where 

qr = Aexp{~Br'^), r > 1, 

with positive parameters A,B,I3. It is shown in this case 
[18| that the miss probability for LRU is asymptotic to 

M(C) ~ ^C'-^qc 

for large C. For a geometric popularity distribution (with 
/3 — 1), the latter estimate shows that M(C) — 0{qc)', on 
the other hand, formula ( |12[ ) of Corollary 3.4 shows that 
M{C) = 0{Cqc) for RND discipline. This illustrates the 
fact that RND and LRU replacements provide significantly 
different performance levels if the popularity distribution is 
highly concentrated on a relatively small number of objects. 

4. NUMERICAL RESULTS: SINGLE CACHE 

We here present numerical and simulation results to vali- 
date the preceding estimates for a single RND policy cache. 
In the following, when considering a finite object popula- 
tion with total size A^, the Zipf popularity distribution is 



normalized accordingly. We also mention that the content 
popularity distribution obviously refers to document classes 
instead of individual documents. For comparison purpose 
with the existing LRU analysis, we represent these classes 
by a single index, as if they were a single document. In the 
following, cache sizes must accordingly be scaled up to the 
typical class size. 

Simulations are performed using an ad-hoc simulator writ- 
ten in C. In every simulation, performance measures are col- 
lected after the transient phase, once the system has reached 
the stationary state. In this paper, transient behavior is not 
considered; note that the duration of the transient period 
obviously increases with the cache size. 

Besides, the most critical parameter in our simulation set- 
ting is the numerical value of a. As the Zipf distribu- 
tion flattens when a get closer to 1, much longer simula- 
tion runs are necessary to have good estimates of the miss 
rate. Small enough values of a must, nevertheless, be con- 
sidered as they are more realistic. Estimates of a are re- 
ported, in particular, in [24] for web sites providing ac- 
cess to video content like www.metacafe.com for which a = 
1.43, www.dailymotion.com and www.veoh.com for which 
a = 1.72 and a = 1.76, respectively. In the following, we 
hence fix a = 1.5 or q = 1.7 in our numerical experiments. 



Fig. |2 (a) [ first reports exact formula Q for M (C) = M{C; N) 
as a function of cache size C and for increasing values of to- 
tal population A'^, where M{C;N) measures the total miss 
probability for a cache of size C when the number of ob- 
jects A'^ is finite. As expected, the convergence speed of 
M (C; A^) to M (C; 00) as A' t +00 increases with a. In the 
case a — 1.5 for instance, a population of A' = 20 000 can be 
considered a good approximation for an infinite object popu- 
lation (A' — 00), whilst there is almost no difference between 
M{C; N ^ 20 000) and M(C; A" = 00) when a = 1.7. 



In Fig. 2(b) we compare exact formula ([3| for M{C), asymp- 
totic ([21[) and simulation results for the above scenario. For- 
mula ra for N = 00 is computed with arbitrary precision 
and we used A' = 20 000 for simulation as a good approx- 
imation for an infinite object population. Simulation and 
exact results are very close (especially for a — 1.7) while 
asymptotic ( [21| | gives a very good estimation of the miss 
probability as soon as C > 20. 



Fig. 2(c) presents the miss probability Mr{C) as a function 
of the object rank r for both RND and LRU policies with 
fixed C = 25 and a = 1.7. Results are reported for the 
most popular classes and confirm the asymptotic accuracy 
of estimate ( [23[ l for RND and the corresponding one for LRU 
policy [19] . Beside the good approximation provided by the 
asymptotics, it is important to remark that RND and LRU 
performance are very close when object rank r > 15, while 
there is a slight difference for the most popular objects (say 
r < 15). Moreover, comparing M{C — 25) for RND and 
LRU (respectively equal to 0.147 and 0.108), we observe 
only 4% less of miss probability using LRU with respect to 
RND policy. This may suggest RND as a good candidate for 
caches working at very high speed, where LRU may become 
too expensive in terms of computation due to its relative 
complexity. 
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Figure 2: Single cache results: (a) exact formula for 
M(C) with RND policy (b) asymptotic of M(C) with 
RND policy (c) asymptotic of Mr{C) with C = 25, 
a = 1.7 for RND and LRU policies. 



5. IN-NETWORK CACHE MODEL 

In order to generalize the single-cache model, networks of 
caches with various topologies can be considered. 



We first consider the tandem system defined as follows. Any 
request is addressed to a first cache jjl with size Ci; if it is 
not satisfied, it is addressed to a second cache [12 with size 
Ca: 

- if this request is satisfied at cache P, the object is copied 
to cache [jl, with replacement performed according to the 
RND discipline; 

- if this request is not satisfied at cache (j2, the object is 
retrieved from a repository server and copied in caches jjl 
AND tt2 according to the RND discipline. 

Note that this replacement scheme, hereafter denoted by 
IPC for In-Path Caching, ignores any collaboration be- 
tween the two caches and blindly copies objects in all caches 
along the path towards the requesting source. 
We now fix some notation and properties for this tandem 
model. Let Ri{t) G {1, 2, A'^} denote the object requested 
at cache (tl at time t; we still assume that variables Ri{t), 
t £ N, describe an IRM process with distribution defined 
by P(7?i = r) = qr, I < r < N. Denoting by Si{t) (resp. 
S2{t)) the state vector of cache (tl (resp. [12) at time t, the 
bivariate process (Si(i), S2(t))teN is easily shown to define a 
Markov process that, however, is not reversible. It is there- 
fore unlikely that an exact closed form for the stationary 
distribution of process (81,82) can be derived to evaluate 
the miss probability for the two-cache system. 
Alternatively, we here follow an approach based on the ap- 
proximation of the request process to cache [12. Let t^, 
n £ N, denote the successive instants when a miss occurs 
at first cache ftl, and R2{n) be the object corresponding to 
that miss event at time tn- First note that the common 
distribution of variables i?2(n) is the stationary distribu- 
tion (qj.(2))r6N introduced in Lemma [3. 3[ equation Q, with 
cache size C replaced by Ci. In the following, we will further 
assume that 

(H) the request process for cache (12 is an IRM, that is, all 
variables R2{n), n £ N, are independent with common 
distribution 

P(i?2 = r) = qr{2), reN. 

The simplifying assumption (H) neglects any correlation 
structure for the output process of cache jjl (that is, the 
input to cache jj2) produced by consecutive missed requests. 
Recall also that the tail of distribution (g,.(2))rgN, defined 
by ( |11[ ), is proportional to that of distribution (gr)reN. 

The latter 2-stage tandem model can be easily extended to a 
tandem network consisting in a series of K caches {K > 2) 
where any request dismissed at caches jjl,...,Jj^, £ > 1, is 
addressed to cache (j(^ -I- 1). As an immediate generaliza- 
tion of the IPC scheme, we assume that any requested doc- 
ument which experiences a miss at cache jjj, 1 < j < I, 
and an object hit at cache (j(f -f 1) is copied backwards at 
all downstream caches A request miss therefore 

corresponds to a miss event at each cache 1,2, Fur- 
thermore, assumption (H) is generalized by saying that any 
cache j|^ considered in isolation behaves as a single cache 
with IRM input produced by consecutive missed requests at 
cache j|(^ — 1). The size of cache j|^ is denoted by Ci. 



5.1 Line topology 



In the following, the "global" miss probability Mr(Ci, . . . , Ce) 
(resp. "local" miss probability M*(Ci, . . . , C^)) for request r 



at cache I is the miss probabihty for object r over all caches 
!,...,£ (resp. the miss probability for object r at cache £) so 
that 

e 

MriCi, ...,C,)^1[m:{Ci,..., C,) (27) 

i=i 

(note that for a single cache, we have Mr(Ci) = M*(Ci)). 
To simplify notation, we abusively write Mr{i) (resp. M*{£)) 
instead of (Ci, . . . ,Ce) (resp. instead of M* (Ci, . . . ,Ce)). 
Finally, ii qr{£), r > 1, defines the distribution of the in- 
put process at cache (t^, the averaged local miss probability 
M*{£) at cache ^£ is given by 

M*(/) = ^AC(%r(/) (28) 
for any £ € {1, ...,K}. 

Proposition 5.1. (See Proof in Appendix^ 

For the K -caches tandem system with IPC scheme, suppose 
that the request process at cache (jl is IRM with Zipf popu- 
larity distribution with exponent a > 1, and that assumption 
(H) holds for all caches jj2, 

For any £ G {1,...,K} and large cache sizes Ci,...,Ct, the 
global miss probability Mr(£) (resp. local miss probabihty 

M* {£) ) is given by 

e-i 

Per + Cj 

Mr{£) , M:i£) (29) 

Pc.r°'+J2C]' par-"+^C" 
j=i i=i 



time t. Requests are served according to the IPC rule, i.e., 
are forwarded upwards until the content is found, and the 
content is then copied in each cache between this location 
and the addressed leaf. 



Corollary 5.2. Consider a homogeneous tree with IPC 
scheme and suppose that assumption (H) holds for all its 
internal nodes. The results of Proposition \5.1\ then extend 
to that tree with IRM request process at leaves and Zipf pop- 
ularity distribution with exponent a > 1. 



Proof. Only the order of requests in time matters since 
their precise timing is irrelevant; we can consequently as- 
sume that the requests arrive according to a Poisson process 
with intensity 1. From the property of independent thinning 
and merging of Poisson processes, it follows that the requests 
for a given object r at leaf j is also a Poisson process of inten- 
sity PjQr, and that the request process at leaf j is a Poisson 
process with intensity Pj with a Zipf popularity distribution 
qr = A/r", r > 1. Now, using assumption (H) and apply- 
ing the previous results for a single cache to each leaf, we 
deduce that at any leaf j, the miss sequence for object r is 
a Poisson process with intensity pjqrM* (1). Merging these 
miss sequences from all children of a given second-level node, 
we deduce that the requests at this node follow a Poisson 
process and that the probability of request for an object r 
is qrM*{l)/M{l) = qr(2). This process has the same prop- 
erties as the IRM process with distribution (gr(2))rGN used 
in the proof of Proposition ! 5. 1| which therefore applies. Re- 
peating recursively this reasoning at each level, we conclude 
that Proposition |5.1| holds in this context □ 



Proposition |5.1| shows how the K-stage tandem system with 
IPC scheme improves the performance in terms of miss 
probability by adding a term Cf when the j-th cache is 
added to the path. From Propositions |5 . 1 1 and |3 . 9| it is read- 
ily derived that the average global miss probability M{£) for 
all objects requested along the cache network is 



Apa 



(30) 



i=i 



for any £ £ {1, K} and large cache sizes Ci, 



Ct. 



5.2 Tree topology 

The previous linear network model can be easily extended 
to the homogeneous tree topology with Zipf distributed re- 
quests. By homogeneous, we mean that all leaves of the 
tree are located at a common depth of the root, and that 
the cache size for each node at a given level i is equal to d 
(where Ci is the cache size of the leaves). An example of 
such a tree is a complete binary tree of given height. 

Let Ai, . . . , Aj be the J leaves of the tree. We assume that 
all requests arrive at the leaves, following an IRM, that is, 
P(ii(t) = r,A(t) = j) = pjqr for all 1 < r < TV, 1 < j < J, 
where (pi, . . . ,pj) are positive values such that Ei<j<j pj — 
1 and A(t) denotes the leaf where the request t arrives at 



Remark 5.1. Corollary \5.2\ is also valid for a homoge- 
neous tree where different replacement policies are used at 
different levels i (e.g. Random at first level and LRU at 
second one). 

6. NUMERICAL RESULTS: NETWORK OF 
CACHES 

We here report numerical and simulation results to show 
the accuracy of the approximations presented in previous 
Section [5] 



Fig. [3(a)1 first reports estimate ^ of M;(1) and M*{2) 
for both RND and LRU (the approximation for the tandem 
LRU are taken from [6]) with Ci = C2 = 25. We focus on 
the second cache, as the performance of the first one has 
been largely analyzed in previous sections. We note a good 
agreement between the approximations evaluated in Section 
[5] and simulation results. 

Moreover, while less popular objects are affected in the same 
way when employing either RND or LRU (in our specific 
example, r > 15), a significantly different behavior is de- 
tectable for popular objects (r < 15). Local miss probabili- 
ties M*(l) and M*(2) help understanding where an object 
has been cached, conditioned on its rank. The combination 
of LRU and IPC clearly tends to favor stationary configu- 
rations where popular objects are likely to be stored in the 



first cache (see also [6] for a similar discussion). When us- 
ing RND instead of LRU, however, the distribution of the 
content across the two caches is fairly different; in Fig. |3(a)| 
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Figure 3: Tandem cache results: (a) asymptotics of 
M;(1), M*.{2) (b) and of A/^(2) for RND and LRU 
policies compared to simulation with Ci = C2 = 25, 
a = 1.7 (c) Asymptotic for M(2) = M{Ci,C2) with 
Ci = C2 < 100, Q = 1.7. 
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for example, while the most popular objects are likely to be 
retrieved at the first cache when using either LRU or RND, 
only by using RND can such an object be also found in the 
second cache. It therefore appears that while both LRU and 
RND tend to store objects proportionally to their popular- 
ity, RND more evenly distributes objects across the whole 
path. 



Fig. 



3(b) reports the total miss probability at the second 

(i)m;(2), 



cache Mr{2) — M*(1)M*(2), i.e., the probability to query 
an object of rank r at the repository server. In this example, 
we see that objects with rank r < 15 are slightly more fre- 
quently requested at the server when using RND rather than 
LRU, but RND is more favorable than LRU for objects with 
higher rank r > 15. In average, the total miss probability at 
the second cache M(2), reported in Fig. 3(c) is very similar 
either using RND or LRU, with a slight advantage to LRU. 
M(2) indicates the amount of data that is to be requested 
from the server. 

With no claim of generality, we notice that the approxima- 
tions calculated in Section [5] for RND and in |5], ^ for LRU 
are very accurate. Furthermore, the approximations work 
well for a large number of tests that we do not report here 
because of lack of space. 



7. MIXTURE OF RND AND LRU 

So far, we have considered networks of caches where all 
caches use the RND replacement policy. In practice, it is 
feasible to use different replacement algorithms in the same 
network. This section addresses the case of a tandem net- 
work, where one cache uses the RND replacement algorithm 
while the other uses the LRU algorithm. As in Section [5. 2| 
these results also hold in the case of an homogeneous tree. 



7.1 Large cache size estimates 

We first provide estimates for miss probabilities in the case 
when cache sizes Ci and C2 are large. 



Proposition 7.1. For the 2-caches tandem system with 
IPC scheme, suppose the request process at cache (tl is IRM 
with Zipf popularity distribution with exponent a > 1 and 
that assumption (H) for cache ^2 holds. 
I) When cache [tl (resp. cache (|2^ uses the RND (resp. 
LRU) replacement policy, the global (resp. local) miss prob- 
ability Mr(2) (resp. M*(2)) on cache jJ2 is given by 



, , /„x Par" f PaC" 

' ~ p.r" + Cr V "A,, (p„r« + C?) J ' 



M;(2) ~ exp ( , 



(31) 



for large cache sizes Ci, C2 and constants pa, introduced 
m ^ and 

II) When cache [jl (resp. cache ^2) uses the LRU (resp. 
RND) replacement policy, the global (resp. local) miss prob- 



ability M,.(2) (resp. M*{2)) on cache (2 is given by 

Per" 



par exp 



Mr{2) ~ 

m;(2) ~ 

Par + C2 exp 
for large cache sizes Ci, C2- 



a\ar° 
Par"" 



aA„r" 



(32) 



Proof. We follow the same derivation pattern as the 
proof of Proposition [5?T] detailed in Appendix [E] 
I) When cache (jl uses the RND replacement policy, we know 
from Appendix |E] that the request process at cache [j2 is 
IRM with popularity distribution 



qr{2) = g, 



m;(i) 



r > 1, 



M*(l) C-}+Par° 

and is asymptotically Zipf for large r. We then follow the 
proof of Proposition 6.2 of [g]. Let 5*2(0, t) be the number of 
different objects requested at cache (12 in the time interval 
[G,t]\ it verifies 

E[52(0,t)] = ^(l-e-'-'^"). 



We then first deduce that 
E[S2(0,t)l > 



-<J,,(2)t 



Au. 



(33) 



Using the variable change v = C° ^t/{Ci + PaU°') in the 
latter integral, we further obtain 



E[5'2(0,t)] > 



t 



Av . 



Letting t f +00, the monotone convergence theorem for 
function v ^ {1 — Civ /t)^^^^^" together with a further in- 
tegration by parts yield 



lim 



t 



(34) 



Starting integral ( |33[ ) from u — instead of ti = 1, the latter 
asymptotic bound is seen to hold a lso as an upper bound of 
E [52(0, /t, thus showing that (34l actually holds as an 
equality. The local per-object miss rate on the second cache 
for an LRU cache is then 



M;(2) ~ exp 



~q.(2)C2" lim 
\ tt+00 



E[5'2(0,t)]'= 



which proves expressions (31 1 



II) When cache Jl applies the LRU replacement policy, the 
local per-object miss rate at cache Jl is known [19] to equal 



M*{1) ~ exp 



cr 



[r(i- 1^ 







— exp 





and that the local average miss rate is 



M*(l) 



r 1 



Using assumption (H), it then follows that the input process 
at cache jj2 is IRM with popularity distribution given by 



g,.(2) = q, 



m;{i) gi 

M*{1) ^ 



\ar°' 



exp 



r > 1. 



Note that this distribution is asymptotically Zipf for lar ge r , 
with coefficient A' (2) = A/M*{1). Applying estimate (25 I 
to the above defined distribution ^^(2), r > 1, it then follows 
that 

M:{2)^{l + qA2)e'i2))-\ 

where the associated root 6' (2 ) is e asily estimated by ^'(2) = 
C2 /A' {2)pa by using Lemma 3.8 We hence derive that 



m;(2) 



1 + 



c? 



am;(i) 



A'{2)pa r"M*(l) 
which leads to expressions (|32|) □ 



7.2 Numerical results 

We here report numerical and simulation results for mixed 
homogeneous tree topologies to show the accuracy of the 
approximations presented in previous section, in order to 
derive some more general considerations about the mixture 
of RND and LRU in a network of caches. According to 
Section [5.2| we simulate in fact tree topologies, with 2 leaves 
with cache size Ci and one root with cache size C2. 



Fig. 4(a) reports M*{2) for RND-LRU and LRU-RND ho- 



mogeneous tree networks with asymptotics (31 1 and (32 1, 
respectively. We first note that the latter provide estimates 
with reasonable accuracy. Besides, we observe that the be- 
havior of M*{2) is in strict relation to the policy used for 
cache jjl. If cache (tl is RND then, M*{2) has a behavior 
similar to that observed in RND caches tandem; similarly, 
if cache [jl is LRU, M*(2) behaves as in the case of LRU 
caches in tandem. 

This phenomenon has a natural explanation. RND and LRU 
act in the same way on objects ranked in the tail of the Zipf 
popularity distribution. However, the two replacement poli- 
cies manage popular objects in a rather different way, as we 
already observed in Section [5] The second level caches see a 
local popularity that is shaped, by the first level of caches, in 
the body of the probability distribution. The portion of the 
distribution that is affected by such shaping process is deter- 
mined by the cache size Ci at first level. In the analysis re- 
ported in Fig. [Ij the first level cache significantly determines 
the performance of the overall tandem system. In Fig. 4(b) 



we observe that the total miss probability Mr (2) in the two 
mixed tandem caches is similar, while the distribution of the 
objects across the two nodes varies considerably. 



Finally, in Fig. 4(c) we present simulation results of M{2) = 
M(Ci, C2) for all possible configurations in an homogeneous 
tree network in which Ci — C2 < 100. Observe that LRU- 
RND tree cache network achieve slightly better performance 
than the LRU-LRU system at least with Zipf shape parame- 
ter a — 1.7. This behaviour would suggest to prefer LRU at 
the first cache beacuse it performs better in terms of Miss 
probability and RND at the second one in order to save 
significant processing time. 



8. CONCLUSION 

The recent technological evolution of memory capacities, as 
illustrated by the deployment of CDNs and the proposition 
of new information-centric architectures where caching be- 
comes an intrinsic network property, raise new interests on 
caching studies. 




15 20 25 30 35 40 
Object rank r 

(a) 





1 r 


;=) 


0.8 








0.6 


o 




i-l 

Oh 




% 


0.4 




0.2 








RND - LRU 

LRU - RND 








Sim. 

Asymptotic 


□ 
o 

























































10 15 20 25 30 35 40 
Object rank r 
(b) 



0.25 



RND-RND 
LRU-LRU 
RND-LRU - 
LRU-RND - 




10 20 30 40 50 60 70 80 90 100 
Cache capacity [objects] 

(c) 

Figure 4: Mixed homogeneous tree cache results for 
mixed tree cache networks with Ci = 25, C2 = 50, 
a — 1.7: (a) asymptotic of M*(2) (b) asymptotic of 
Mr{2) (c) Simulation of M(2) = M{Ci,C2) with Ci = 
C2 < 100, Q = 1.7. 



In this paper, we have studied the RND replacement policy, 
where objects to be removed are chosen uniformly at ran- 
dom. Assuming that the content popularity follows a Zipf 
law with parameter a > 1, and that content requests are 
IRM, we prove that for large cache size C, the mis s rat e is 
asymptotically equivalent to ApaC^~°' (see Lemma 3.7 and 
Proposition 3.9 1. This shows that the difference between 



LRU and RND caches is independent of the cache size and 
only depends on coefficient Apa . These results are extended 
to typical network topologies, namely tandems and homo- 
geneous trees, under the assumption that requests are IRM 
at any node. The case of mixed policies (RND at one net- 
work level and LRU at the other one) is also considered. 
Simulations show that the IRM assumption applied to net- 
work topologies is reasonable and that provided estimates 
are accurate. 

Our results suggest that the performance of RND is rea- 
sonably close to that of LRU. As a consequence, RND is a 
good candidate for high-speed caching when the complexity 
of the replacement policy becomes critical. In the presence 
of a hierarchy of caches, caches at deep levels (i.e. access 
networks) typically serve a relatively small number of re- 
quests per second which can be easily sustained by a cache 
running LRU; LRU policy should consequently be imple- 
mented at the bottom level since it provides the best perfor- 
mance. Meanwhile, higher-level caches see many aggregated 
requests and should therefore use the RND policy which 
yields similar performance (as to second-level cache) while 
being less computationally expansive. 

We have assumed in this paper that the parameter a of the 
Zipf distribution is larger than 1, and the total number of 
available objects is infinite. As an object of further study, 
we first intend to explore the case where a < 1, with a finite 
number of objects. Besides, since Zipf popularity distribu- 
tions do not represent all types of Internet traffic, we also 
intend to analyze the performance of RND caches when the 
content requests follow a light-tailed (e.g. WeibuU) distri- 
bution. Finally, all results derived in this paper hold for 
i.i.d. content requests. Admittedly, actual traces show that 
requests are correlated, both in time and space. The def- 
inition of an accurate and realistic model which can take 
these correlations into account, as well as the extension of 
the present results to such a request model is also on our 
research agenda. 
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APPENDIX 

A. PROOF OF THEOREM |32] 

(i) Using equation ( |14[ ) reduces to g{z) = C where 
F'{z) _^ q,Z 



F(z) 



El 

j>i 



-qjZ 



(35) 



Gontinuous function g : z £ [0,+oo[ — >■ g{z) G [0,+C)o[ 
vanishes at z = 0, is strictly increasing on [0, -)-cxj[ and tends 
to +00 when z t 4-c>o. Equation (14 1 has consequently a 



unique positive solution 6c. Note that 6c tends to -foo 

with C since g{z) < Ej>i(gjz) — z, hence 6c > C. 

(ii) Gonsider the random variable Xc with distribution 



r{Xc = a;) = 



F{6c[ 



x>0. 



(36) 



where 6c satisfies (141; note that definition (36 1 for Xc is 
equivalent to 



G{x)=^^nXc = x), 



x> 0. 



(37) 



Using definition ( |36[ ), the generating function of random 
variable Xc is z i-^ F{z6c) / F{6c)\ in view of (14 1, the 
expectation of variable Xc is then 



E(Xc 



d F(z6i 
Az F{ec 



6c 



F'( 



F{ec 



C 



so that random variables Yc ~ {Xc — C)/VC, C > 0, are 
all centered. Besides, the Laplace transform of variable Yc 
is given by 



E(6 



,VcF{6ce 



F{6c) 



for all s £ C. By Levy's continuity theorem ( [27| , Theorem 
4.2.4), assumption (151 entails that variables Yc converge 



in distribution when C t +00 towards a centered Gaus- 
sian variable with variance a^; moreover, assumption (16 1 
ensures that the conditions of Chaganty-Sethuraman's theo- 
rem ([8], Theorem 4.1) hold so that P(Xc = C) = ¥{Yc = 0) 
is asymptotic to 

(38) 



r{Xc = C) ~ l/aV2^ 

as C t +00 ■ Equation ( |37| fo r x = C and asymptotic (38 1 
together provide estimate (|17p for G(C) □ 



B. PROOF OF LEMMA |3J] 

Let q{x) — A/x" and fz{x) = log(l -f (l(x)z) for any real 
a; > 1 and 2 £ C; definitions ([5|, ( |13| and the above notation 
then entail that 

logF(z) = ^/.(r); 

function log F is analytic in the domain C\R~ . For given z G 
C \ R~ and integer J > 1, the Euler-Maclaurin summation 
formula [I] reads 



where L is the finite integral [I] 
L ■ 









/ log 











sin(7r/a) 



ap, 



(43) 



with Pa introduced in (191 for a > 1, and where 

nl/(Az)l/» 



log 1 + 



dt ■ 



log{Az) 



+ 



+ o(l). 
(44) 

Gathering (43 1 and (441 provides expansion (42 1. Using the 
explicit expression of /i^', the dominated convergence the- 
orem finally shows that when z f +00, remainder in ( |41| ) 
tends to some finite constant ta depending on a only. Gath- 
ering terms in (40 1- (42 1, we are finally left with expansion 
a-|-a/12-|-tc«/6. Some further cal- 
alog(27r)/2, although this 



( 18 1 with constant Sa 
culations would provide Sa 
actual value does not intervene in our discussion □ 



C. PROOF OF PROPOSITION |3j] 

Recall definition ( |35| l of function g and write oquivalently 
5(2) = X^S^l'")- 



f4x)dx+-[MJ) + Mi)] + 



(39) 



with 



TAJ) 



B3{{x})fi^\x)dx 



where B^x) = x{x — l)(2a:: — l)/2 is the third Bernoulli 
polynomial and {x} denotes the fractional part of real x\ 
derivatives of fz are taken with respect to x. Consider the 
behavior of the r.h.s. of (39 1 as J tends to infinity. We first 



have fz{l) = log(l + Az) and /,(J) = 0(J~") for large J; 
differentiation entails 



aAz 
1 + Az 



and f'z{J) = 0(J~"~^) for large J. Differentiating twice 
again with respect to x shows that the third derivative of 
fz is 0{x~°'~^) for large positive x, and is consequently in- 
tegrable at infinity. Letting J tend to infinity in ( |39| l and 
using the above observations together with the boundedness 
of periodic function a: > 1 1— >■ Badx}), we obtain 



aAz 



Tz 



log Fiz) = fz (x)d.+ 2 log(l + A.) + ^^^^^ + ^ 

(40) 

where 



Tz = 



B3{{x})fi''>{x)d. 



(41) 



Now, considering the first integral in the r.h.s. of (|40|, the 
variable change x = t{Azy^°' gives 

fz{x)dx={Azy^-[L- J^^ log(^l + -jdtJ 

(42) 



L{Az[ 



l/a 



log(^2:) - a + 0(1) 



where we let gz{x) = Az{x°' + Az) ^ . The Euler-Maclaurin 
summation formula [l] applies again in the form 



r=l 1 



gzix)dx+- [gziJ)+gz{l)] + 



WzjJ) 

6 



(45) 



for given 2 G C \ K , integer J > 1 and where 



WziJ) I < 



12 

(2^ 



7(3) (x) I dx 



(derivatives of gz are taken with respect to variable x). Con- 
sider the behavior of the r.h.s. of (45 I as J tends to infin- 



ity. Firstly, ^^(1) = Az/{Az + 1) and g^(J) = 0(J"") as 
J t +00; secondly, g'z{l) = —Aaz{l + Az)~^ together with 
g'z{J) ~ 0(J~"~^) for large J. Differentiating twice again 
shows that the third derivative of gz is 0(a;~"~'^) for large 
positive X and is consequently integrable at infinity. Letting 
J tend to infinity in (|45[) therefore implies equality 



where 



gz{x)dx + 



Az 



- + - 



1 



Aaz 



Wz 

2Az + l ' 12(1 + Az)^'^ If "-^^^ 



\Wz\< 



12 

(2^ 



7?' (2;) i dx. 



Using the explicit expression of the derivative g 
simply shown that 



(3) 



it can be 



Wz 



0{z-''^"), I Wz I =0(z-'l 



Z), \ Wz \=0{Z-^) 

(47) 

if Q > 2, a = 2 and 1 < a < 2, respectively. Now, consider- 
ing the first integral in the r.h.s. of ( |46| l, the variable change 

X = t{Az)^^° 



L 



gives 
-00 

gz{x)dx = I{Azf^" 



1 + ( - 



(48) 



where I = Lj a — p]I", with integral L introduced in (43 1 



for a > 1. Expanding all terms in powers of z for large z, it 
therefore follows from Jlel) and Jisl that 



g{z)=pT{Azf''' ^\ + W. 

with Wz estimated in \A7\ . For large C, equation ( |35[ ) 
i.e.g{6c) ~ C, then reads 



j + ^i+OiWoa) 



(49) 



for a > 2 since ^ implies VKe^ = 0(6''^^°') = 0(C-2) 
in this case. The case 1 < a < 2 gives a similar expansion 
since the remainder is Wg^/C = 0(C-"/C) = 0(C-°-'). 
Finally, the case a — 2 yields 



A9c = 



C 



o 



logC 
C 



(50) 



Gathering results (|49|-([50f finally provides expansions ([20 
for Oc □ 



D. PROOF OF PROPOSITION |3j 

We here verify that conditions ( |15[ ) and ( |16[ ) of Theorem 
|3.6| are satisfied in the case of a Zipf popularity distribution 
with exponent a > 1. Let us first establish convergence 
result ( |15[ ). Using Lemma [3.7[ we readily calculate 



-s/s/C\ 



e — , — e exp 



F{ec) 



2VC 



(51) 



for any given s G C with 5R(s) — 0, | Si(s) | < a and where 
e{9) — > as 6* t +oo. By Lemma 3.8 we further obtain 
a{paA6c)^^°' ~ aC + apc,rc + o{rc) and the expansion of 



1 at first order in 1/C entails that 



yip^AOc)'^" (e""/"^ - l) = -sVC + + O 



s 

2a 



letting C tend to infinity, we then derive from (51 1 and the 
previous expansions that 



-s/^/C^ 



FiOc 



exp 



2a 



so that assumption (15 I is satisfied with — l/a. 

Let us finally verify boundedness condition ( 16 1. Rephrasing 

(51 1 for s = —iyyfC, we have 



Fide) 



i/c 



= exp 



[aip^AOcy/'' 



C 



(e'«/° - l) 



+ 



ejece^y) - e{6c) - 
C 



iy_ 
2C 

(52) 



for any y G R. But as above, a{paA9cY^°' /C tends to the 
constant a when C t +oo so that 

-\\ (53) 



F{ece 



Fide) 



<\ h{y) [ X exp ^ 



C 



for some positive constant /3 and where 



h{y) = |exp (^e*''/" ^ ^) \ ^ ''^P (°°^ (^) " ^) 



Function h is continuous, even and given 5 > 0, ft is decreas- 
ing on interval [S, tt] since a > 1, hence h{y) < h{d) = rjs < 1 
for 5 < y < -K. Using the estimates derived in Appendix [B| 
it is further verified that, given any compact C C not con- 
taining the origin, we have limct+oo e{9cu) = uniformly 
with respect to u £ K,; this entails that the exponential 
term in the right-hand side of (|53[) tends to 1 when C t +00 



uniformly with respect to u = e''' , j/ £ [5, tt] . We finally 
conclude that condition is verified □ 

E. PROOF OF PROPOSITION O 

Let qr{l + 1), r > 1, denote the distribution of the input 
process at cache £ > 1. By the same reasoning than 

that performed in Lemma [3. 3[ we can write 



M*{£) 



M;{1)...M*{1) 
' M* {£)... M*(l) 



(54) 



for all r G N, where M*{£) (resp. M*{£)) is the local miss 
probability of a request for object r at cache jj^ (resp. the 
averaged local miss probability for all objects requested at 
cache (jf) introduced in (28 1 and with notation qr = gr(l). 
As M*{£') — 5> 1 for all £' < £ when r t +00, we deduce from 
dm that 



qr{£+l) 



Ai£) 



when r t -^00, where A{£) = A/M* {1)M* {2)...M* (£). Ap- 
ply then estimate (251 to obtain 



m:{£+i) 



l + e(£+l)qr{£+l) 



(55) 



where e{£ -I- 1) ~ Ce+-^/A{£)pc and with qr{£ + 1) given 



by (54 1; using the value of A{£) above and the definition 
qr = A/r°', the product 6{£ + l)qr{£ + 1) reduces to 



^{£ + l)qr{£+l) 



A(£)p^ 
Pc.r°' 



m*{i)...m;{i) 

' M* {£)... M* (I) 



M* {£)... M*{1) 



(56) 



Writing then Mr{£+1) = Mr{£)M*{£ + l), asymptotics ([55l 
and (|56[) together yield 



Mr{£+1) 

so that 



M.r{£) 



e+1 
Per"' 



m;(^)...m;(i) 



+ 



'-'I 



Mr{l+l) Mr{£) Par 



since Ilj^iM* (j) = Mr{£) after (27 1; the latter recursion 
readily provides expression (29 1 for Mr{£), 1 < £ < K . 
Using relation Mr{£ -I- 1) = Mr{£)M*.{£ + 1) again together 
with expression ( 29 1 of Mr {£) provides in turn expression 

(29| for m;(^), 1 < ^ < a: □ 



